Introduction and explanation
In economics, the Lorenz curve is a graphical representation of the distribution of income or of wealth.
The curve is a graph showing the proportion of overall income or wealth assumed by the bottom \(\mathbf{x\%}\) of the people. It is often used to represent income distribution, where it shows for the bottom \(x\%\) of households, what percentage \(y\%\) of the total income they have.
A perfectly equal income distribution would be in which everyone has the same income. In this case, the bottom \(N\%\) of society would always have \(N\%\) of the income. This can be depicted by the straight line \(y = x\), called the “line of perfect equality”.
By contrast, a perfectly unequal distribution would be one in which one person has all the income and everyone else has none. In that case, the curve would be at \(y = 0\%\) for all \(x < 100\%\), and \(y = 100\%\) when \(x = 100\%\). This curve is called the “line of perfect inequality”.
Definition and calculation
The Lorenz curve can usually be represented by a function \(L(F)\), where \(F\), the cumulative portion of the population, is represented by the horizontal axis, and \(L\), the cumulative portion of the total wealth or income, is represented by the vertical axis.
The curve \(L\) need not be a smoothly increasing function of \(F\). For wealth distributions there may be oligarchies or people with negative wealth for instance.
- Discrete distribution
For a discrete distribution of \(Y\) given by values \(y_1, ..., y_n\) in non-decresing order (\(y_i \le y_{i+1}\)) and their probabilities \(f(y_i) := Pr(Y = y_i)\), the Lorenz curve is a continuous piecewise linear function connecting the points \((F_i, L_i)\) for \(i=1\) to \(n\), where \(F_0 = 0, L_0 = 0\).
\[
\begin{align}
F_i &:= \sum_{j=1}^{i} f(y_j) \\
S_i &:= \sum_{j=1}^{i} y_i f(y_i) \\
S &:= \sum_{i=1}^{n} y_i f(y_i) \\
L_i &:= \frac{S_i}{S}
\end{align}
\]
- Continuous distribution
For a continuous distribution with the PDF \(f\) and the CDF \(F\), the Lorenz curve \(L\) is given by
\[
L(F(x)) = \frac{\int_{-\infty}^{x} t f(t) dt}{\int_{-\infty}^\infty t f(t) dt} = \frac{\int_{-\infty}^{x} t f(t) dt}{\mu}
\]
where \(\mu\) denotes the average.
The Lorenz curve \(L(F)\) may then be plotted as a function parametric in \(x\): \(L(x) \text{ vs. } F(x)\).